# Large Model Inference Optimization
Meta Llama 3.1 70B Instruct AWQ INT4
INT4 quantized version of Llama 3.1 70B Instruct, optimized with AutoAWQ technology, suitable for multilingual dialogue scenarios.
Large Language Model
Transformers Supports Multiple Languages

M
hugging-quants
80.59k
100
Mixtral 8x22B V0.1
Apache-2.0
Mixtral-8x22B is a pretrained generative sparse mixture of experts model supporting multiple languages.
Large Language Model
Transformers Supports Multiple Languages

M
mistralai
1,032
220
Featured Recommended AI Models